NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Integration of solar flare and coronal mass ejection event data

https://doi.org/10.1016/j.dib.2025.111539

Ji, Anli; Georgoulis, Manolis K; Aydin, Berkay (June 2025, Data in Brief)

Free, publicly-accessible full text available June 1, 2026
Hidden Activity Revealed: Photospheric Energetics and Dynamics with High-resolution Magnetographic Data

https://doi.org/10.3847/2041-8213/adf849

Georgoulis, Manolis K; Li, Qin; Lee, Jeongwoo; Wang, Haimin; Raouafi, Nour E (August 2025, The Astrophysical Journal Letters)

Abstract We revisit an existing but unexplored finding on the calculation of the baseline (i.e., potential) magnetic energy in observed solar magnetic configurations and apply it to two series of high-cadence, cospatial, and cotemporal line-of-sight photospheric magnetograms with a factor of ∼4 difference in spatial resolution. The target is a small coronal hole, ∼80^″across. We find significant differences between the two data sets, with approximate factors of 2.4 in the unsigned magnetic flux, 2.1 in the potential magnetic energy, and 5.2 in the mean amplitudes of the energy variation, all in favor of the higher-resolution magnetograms. Additionally, we find a factor of 2.5 difference in the characteristic magnetic flux replenishment time, with configurations at higher resolution renewing their flux every 46 minutes on average. Energy decreases associated with apparent magnetic flux cancellation events in higher resolution yield power densities above 10⁶erg cm⁻²s⁻¹, seemingly sufficient to sustain coronal holes and drive the fast solar wind. For the first time, this represents apparent energy released at photospheric altitudes rather than energy deposited via the Poynting flux. Lower-resolution magnetograms give 5.4 times less power density output. These intriguing results could have wide-ranging implications for in situ solar wind measurements and their solar sources in the Parker Solar Probe mission, as well as for high-resolution observations featuring simultaneous photospheric and chromospheric magnetograms including, but not limited to, data from the Daniel K. Inouye Solar Telescope.
more » « less
Free, publicly-accessible full text available August 22, 2026
Solar Alfvénic Pulses and Mesoscale Solar Wind

https://doi.org/10.3847/2041-8213/adeb54

Lee, Jeongwoo; Georgoulis, Manolis K; Sharma, Rahul; Raouafi, Nour E; Li, Qin; Wang, Haimin (July 2025, The Astrophysical Journal Letters)

Large-scale solar ejections are well understood, but the extent to which small-scale solar features directly influence the solar wind remains an open question, primarily due to the challenges of tracing these small-scale ejections and their impact. Here, we measure the fine-scale motions of network bright points along a coronal hole boundary in high-resolution Hαimages from the 1.6 m Goode Solar Telescope at Big Bear Solar Observatory to quantify the agitation of open flux tubes into generating Alfvénic pulses. We combine the motion, magnetic flux, and activity duration of the flux tubes to estimate the energy content carried by individual Alfvénic pulses, which is ∼10²⁵erg, adequately higher than the energies ∼10²³erg estimated for the magnetic switchbacks observed by the Parker Solar Probe (PSP). This implies the possibility that the surface-generated Alfvénic pulses could reach the solar wind with sufficient energy to generate switchbacks, even though some of then are expected to be reflected back in the stratified solar atmosphere. Alfvénic pulses further reproduce for the first time other properties of switchbacks, including the filling factor above ∼8% at granular and supergranular scales, which correspond best to the lower end of the mesoscale structure. This quantitative result for solar energy output in the form of Alfvénic pulses through magnetic funnels provides a crucial clue to the ongoing debate about the dynamic cycle of energy exchange between the Sun and the mesoscale solar wind that has been raised, but has not been adequately addressed, by PSP near-Sun observations.
more » « less
Free, publicly-accessible full text available July 16, 2026
Outlier Detection and Removal in Multivariate Time Series for a More Robust Machine Learning–based Solar Flare Prediction

https://doi.org/10.3847/1538-4365/adb9e3

Wen, Junzhi; Ahmadzadeh, Azim; Georgoulis, Manolis K; Sadykov, Viacheslav M; Angryk, Rafal A (April 2025, The Astrophysical Journal Supplement Series)

Abstract Timely and accurate prediction of solar flares is a crucial task due to the danger they pose to human life and infrastructure beyond Earth’s atmosphere. Although various machine learning algorithms have been employed to improve solar flare prediction, there has been limited focus on improving performance using outlier detection. In this study, we propose the use of a tree-based outlier detection algorithm, Isolation Forest (iForest), to identify multivariate time-series instances within the flare-forecasting benchmark data set, Space Weather Analytics for Solar Flares (SWAN-SF). By removing anomalous samples from the nonflaring class (N-class) data, we observe a significant improvement in both the true skill score and the updated Heidke skill score in two separate experiments. We focus on analyzing outliers detected by iForest at a 2.4% contamination rate, considered the most effective overall. Our analysis reveals a co-occurrence between the outliers we discovered and strong flares. Additionally, we investigated the similarity between the outliers and the strong-flare data and quantified it using Kullback–Leibler divergence. This analysis demonstrates a higher similarity between our outliers and strong-flare data when compared to the similarity between the outliers and the rest of the N-class data, supporting our rationale for using outlier detection to enhance SWAN-SF data for flare prediction. Furthermore, we explore a novel approach by treating our outliers as if they belong to flaring-class data in the training phase of our machine learning, resulting in further enhancements to our models’ performance.
more » « less
Free, publicly-accessible full text available April 1, 2026
Explainable Deep Learning-Based Solar Flare Prediction with Post Hoc Attention for Operational Forecasting

Pandey, Chetraj; Angryk, Rafal A; Georgoulis, Manolis K; Aydin, Berkay (October 2023, Springer Nature Switzerland)

Full Text Available
Towards coupling full-disk and active region-based flare prediction for operational space weather forecasting

https://doi.org/10.3389/fspas.2022.897301

Pandey, Chetraj; Ji, Anli; Angryk, Rafal A.; Georgoulis, Manolis K.; Aydin, Berkay (August 2022, Frontiers in Astronomy and Space Sciences)

Solar flare prediction is a central problem in space weather forecasting and has captivated the attention of a wide spectrum of researchers due to recent advances in both remote sensing as well as machine learning and deep learning approaches. The experimental findings based on both machine and deep learning models reveal significant performance improvements for task specific datasets. Along with building models, the practice of deploying such models to production environments under operational settings is a more complex and often time-consuming process which is often not addressed directly in research settings. We present a set of new heuristic approaches to train and deploy an operational solar flare prediction system for ≥M1.0-class flares with two prediction modes: full-disk and active region-based. In full-disk mode, predictions are performed on full-disk line-of-sight magnetograms using deep learning models whereas in active region-based models, predictions are issued for each active region individually using multivariate time series data instances. The outputs from individual active region forecasts and full-disk predictors are combined to a final full-disk prediction result with a meta-model. We utilized an equal weighted average ensemble of two base learners’ flare probabilities as our baseline meta learner and improved the capabilities of our two base learners by training a logistic regression model. The major findings of this study are: 1) We successfully coupled two heterogeneous flare prediction models trained with different datasets and model architecture to predict a full-disk flare probability for next 24 h, 2) Our proposed ensembling model, i.e., logistic regression, improves on the predictive performance of two base learners and the baseline meta learner measured in terms of two widely used metrics True Skill Statistic (TSS) and Heidke Skill Score (HSS), and 3) Our result analysis suggests that the logistic regression-based ensemble (Meta-FP) improves on the full-disk model (base learner) by ∼9% in terms TSS and ∼10% in terms of HSS. Similarly, it improves on the AR-based model (base learner) by ∼17% and ∼20% in terms of TSS and HSS respectively. Finally, when compared to the baseline meta model, it improves on TSS by ∼10% and HSS by ∼15%.
more » « less
Full Text Available
All-Clear Flare Prediction Using Interval-based Time Series Classifiers

https://doi.org/10.1109/BigData50022.2020.9377906

Ji, Anli; Aydin, Berkay; Georgoulis, Manolis K.; Angryk, Rafal (December 2020, 2020 IEEE International Conference on Big Data (Big Data))

Full Text Available
How to Train Your Flare Prediction Model: Revisiting Robust Sampling of Rare Events

https://doi.org/10.3847/1538-4365/abec88

Ahmadzadeh, Azim; Aydin, Berkay; Georgoulis, Manolis K.; Kempton, Dustin J.; Mahajan, Sushant S.; Angryk, Rafal A. (May 2021, The Astrophysical Journal Supplement Series)

Abstract We present a case study of solar flare forecasting by means of metadata feature time series, by treating it as a prominent class-imbalance and temporally coherent problem. Taking full advantage of pre-flare time series in solar active regions is made possible via the Space Weather Analytics for Solar Flares (SWAN-SF) benchmark data set, a partitioned collection of multivariate time series of active region properties comprising 4075 regions and spanning over 9 yr of the Solar Dynamics Observatory period of operations. We showcase the general concept of temporal coherence triggered by the demand of continuity in time series forecasting and show that lack of proper understanding of this effect may spuriously enhance models’ performance. We further address another well-known challenge in rare-event prediction, namely, the class-imbalance issue. The SWAN-SF is an appropriate data set for this, with a 60:1 imbalance ratio for GOES M- and X-class flares and an 800:1 imbalance ratio for X-class flares against flare-quiet instances. We revisit the main remedies for these challenges and present several experiments to illustrate the exact impact that each of these remedies may have on performance. Moreover, we acknowledge that some basic data manipulation tasks such as data normalization and cross validation may also impact the performance; we discuss these problems as well. In this framework we also review the primary advantages and disadvantages of using true skill statistic and Heidke skill score, two widely used performance verification metrics for the flare-forecasting task. In conclusion, we show and advocate for the benefits of time series versus point-in-time forecasting, provided that the above challenges are measurably and quantitatively addressed.
more » « less
Full Text Available
Prediction of solar energetic events impacting space weather conditions

https://doi.org/10.1016/j.asr.2024.02.030

Georgoulis, Manolis K; Yardley, Stephanie L; Guerra, Jordan A; Murray, Sophie A; Ahmadzadeh, Azim; Anastasiadis, Anastasios; Angryk, Rafal; Aydin, Berkay; Banerjee, Dipankar; Barnes, Graham; et al (February 2024, Advances in Space Research)

Aiming to assess the progress and current challenges on the formidable problem of the prediction of solar energetic events since the COSPAR/ International Living With a Star (ILWS) Roadmap paper of Schrijver et al. (2015) , we attempt an overview of the current status of global research efforts. By solar energetic events we refer to flares, coronal mass ejections (CMEs), and solar energetic particle (SEP) events. The emphasis, therefore, is on the prediction methods of solar flares and eruptions, as well as their associated SEP manifestations. This work complements the COSPAR International Space Weather Action Teams (ISWAT) review paper on the understanding of solar eruptions by Linton et al. (2023) (hereafter, ISWAT review papers are conventionally referred to as ’Cluster’ papers, given the ISWAT structure). Understanding solar flares and eruptions as instabilities occurring above the nominal background of solar activity is a core solar physics problem. We show that effectively predicting them stands on two pillars: physics and statistics. With statistical methods appearing at an increasing pace over the last 40 years, the last two decades have brought the critical realization that data science needs to be involved, as well, as volumes of diverse ground- and space-based data give rise to a Big Data landscape that cannot be handled, let alone processed, with conventional statistics. Dimensionality reduction in immense parameter spaces with the dual aim of both interpreting and forecasting solar energetic events has brought artificial intelligence (AI) methodologies, in variants of machine and deep learning, developed particularly for tackling Big Data problems. With interdisciplinarity firmly present, we outline an envisioned framework on which statistical and AI methodologies should be verified in terms of performance and validated against each other. We emphasize that a homogenized and streamlined method validation is another open challenge. The performance of the plethora of methods is typically far from perfect, with physical reasons to blame, besides practical shortcomings: imperfect data, data gaps and a lack of multiple, and meaningful, vantage points of solar observations. We briefly discuss these issues, too, that shape our desired short- and long-term objectives for an efficient future predictive capability. A central aim of this article is to trigger meaningful, targeted discussions that will compel the community to adopt standards for performance verification and validation, which could be maintained and enriched by institutions such as NASA’s Community Coordinated Modeling Center (CCMC) and the community-driven COSPAR/ISWAT initiative.
more » « less
Full Text Available
An Application of Spatio-temporal Co-occurrence Analyses for Integrating Solar Active Region Data from Multiple Reporting Modules

https://doi.org/10.1109/BigData47090.2019.9006185

Cai, Xumin; Aydin, Berkay; Georgoulis, Manolis K.; Angryk, Rafal (December 2019, 2019 IEEE International Conference on Big Data (Big Data))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records